empty image
PARDINUS: Weakly supervised discarding of photo-trapping empty images based on autoencoders
de la Rosa, David, Rivera, Antonio J, del Jesus, María J, Charte, Francisco
Photo-trapping cameras are widely employed for wildlife monitoring. Those cameras take photographs when motion is detected to capture images where animals appear. A significant portion of these images are empty - no wildlife appears in the image. Filtering out those images is not a trivial task since it requires hours of manual work from biologists. Therefore, there is a notable interest in automating this task. Automatic discarding of empty photo-trapping images is still an open field in the area of Machine Learning. Existing solutions often rely on state-of-the-art supervised convolutional neural networks that require the annotation of the images in the training phase. PARDINUS (Weakly suPervised discARDINg of photo-trapping empty images based on aUtoencoderS) is constructed on the foundation of weakly supervised learning and proves that this approach equals or even surpasses other fully supervised methods that require further labeling work.
- North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
- North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
- North America > United States > Nevada > Clark County > Las Vegas (0.04)
- (12 more...)
Contrastive Representation Learning with Trainable Augmentation Channel
Koyama, Masanori, Minami, Kentaro, Miyato, Takeru, Gal, Yarin
In contrastive representation learning, data representation is trained so that it can classify the image instances even when the images are altered by augmentations. However, depending on the datasets, some augmentations can damage the information of the images beyond recognition, and such augmentations can result in collapsed representations. We present a partial solution to this problem by formalizing a stochastic encoding process in which there exist a tug-of-war between the data corruption introduced by the augmentations and the information preserved by the encoder. We show that, with the infoMax objective based on this framework, we can learn a data-dependent distribution of augmentations to avoid the collapse of the representation.
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- North America > United States > Colorado (0.04)
Citizen science projects have a surprising new partner--the computer
For more than a decade, citizen science projects have helped researchers use the power of thousands of volunteers who help sort through datasets that are too large for a small research team. Previously, this data generally couldn't be processed by computers because the work required skills that only humans could accomplish. Now, computer machine learning techniques that teach the computer specific image recognition skills can be used in crowdsourcing projects to deal with massively increasing amounts of data--making computers a surprising new partner in citizen science projects. The research, led by the University of Minnesota-Twin Cities, was chosen as the cover story for the most recent issue of the British Ecological Society's scientific journal Methods in Ecology and Evolution. These camera traps are remote, independent devices, triggered by motion and infrared sensors that provide researchers with images of passing animals.
- North America > United States > Minnesota (0.30)
- North America > United States > Wisconsin (0.06)
- Africa (0.06)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)